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Abstract 

In this paper a special piecewise linear system is studied. It is shown that, under a mild assump¬ 
tion, the semi-smooth Newton method applied to this system is well defined and the method 
generates a sequence that converges linearly to a solution. Besides, we also show that the gen¬ 
erated sequence is bounded, for any starting point, and a formula for any accumulation point 
of this sequence is presented. As an application, we study the convex quadratic programming 
problem under positive constraints. The numerical results suggest that the semi-smooth Newton 
method achieves accurate solutions to large scale problems in few iterations. 
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1 Introduction 

In this paper we consider the following special piecewise linear system: 

+ Tx = b, (1) 

where, denoting by the set of nxn matrices with real entries and the n-dimensional 

Euclidean space, the data consists of 6 a vector in M”, T a nonsingular matrix in the variable 

a; is a vector in M” and x^ is the vector in M” with f-th component equal to = max{xj,0}. 

In [7] was proposed a semi-smooth Newton’s method for solving Under suitable assumption 
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was showed the finite convergence to a solution of ([T|). Some works dealing with ([T]) and its gener¬ 
alizations include [7ti9l ll2l[^l39j . It is worth mentioning that a similar equation has been studied 
in [27]. 

The purpose of the present paper is to discuss the semi-smooth Newton’s method introduced 
in [7], to solve ([H), under new assumptions. As an application, we use the obtained results to study 
the remarkable instance of ([T|), 

[Q - I] x’*' -I- X = -b, (2) 

where the data consists of Q a positive definite real matrix of size n x n and b G M”. Moreover, we 
present some computational experiments designed to investigate its practical viability. It is worth 
pointing out that the semi-smooth Newton’s method for solving ([2|) was studied in m and some 
computational tests were presented in |3|. The results obtained in this paper improve the ones 
of [T7] . As we will show, the system ([2|) arises from the optimality condition of the convex quadratic 
programming problem under a positive constraint. 


Minimize —x^Qx + x~^b + c 
subject to X G 


(3) 


where c is a real number and M” is the nonnegative orthant. Note that, without loss of generality, 
we can assume Q symmetric in ([3]) because the objective function of (l3|) is equal to ^x~^Qx + x~^b+c, 
where Q = ^{Q + Q'^) is a symmetric matrix. Positively constrained convex quadratic programming 
is equivalent to the problem of projecting the point onto a simplicial cone. The interest in the subject 
of projection arises in several situations, having a wide range of applications in pure and applied 
mathematics such as Convex Analysis (see e.g., m), Optimization (see e.g., [^10llllll37j i. Numerical 
Linear Algebra (see e.g., [38]), Statistics (see e.g., piisiisg), Computer Graphics (see e.g., [T8] l 
and Ordered Vector Spaces (see e.g., [T ] l23 p 24 p 32 ( [33]L The projection onto a general simplicial cone 
is difficult and computationally expensive, this problem has been studied e.g., in [2l [T6l[T^l30l[3T] . 
It is a special convex quadratic program and its KKT optimality conditions consists in a linear 
complementarity problem (TCP) associated with it, see e.g., [291130] . Therefore, the problem of 
projecting onto simplicial cones can be solved by active set methods [5l|25l|26l[29] or any algorithms 
for solving LCPs, see e.g., [51129] and special methods based on its geometry, see e.g., [291130] . 
Other fashionable ways to solve this problem are based on the classical von Neumann algorithm 
(see e.g., Dykstra algorithm [masi]). Nevertheless, these methods are also quite expensive (see 
the numerical results in [28] and the remark preceding Section 6.3 in [40j). 

Following the ideas of m, we show that the approach using semi-smooth Newton’s method, for 
solving (|3]), has potential advantages over existing methods. The main advantage appears to be the 
global, linear convergence and to achieve accurate solutions of large scale problems in few iterations. 
Our numerical results suggest, for a given class of problem, that the number of required iterations 
is almost unchanged. The numerical results also indicate a remarkable robustness with respect to 
the starting point. 

The organization of the paper is as follows. In Section Fl. 11 some notations and preliminaries used 
in the paper are presented. In Section [2] we study the convergence properties of the semi-smooth 
Newton’s method for solving ([T]). In Section [3| the results of Section [2] are applied to find a solution 
of dSj). In SectionUJwe present some computational tests. Some hnal remarks are made in Section[5l 
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1.1 Notations and preliminaries 


In this subsection we present the notations and some auxiliary results used throughout the paper. 
Let M” be the n-dimensional Euclidean space with the canonical inner product (•, •) and induced 
norm || • ||. The i-th component of a vector x € M” is denoted by Xi . We use the partial ordering 
for vectors, defined by x < y meaning Xi < yi, for all i = For x G R”, sgn(x) will 

denote a vector with components equal to 1, 0 or —1 depending on whether the corresponding 
component of the vector x is positive, zero or negative. If a G R and x G R”, then denote 
a"*" := max{a, 0}, a~ := max{—a, 0} and x"*" and x~ the vectors with i-th component equal to (xj)"*' 
and (xj)“, respectively. Prom the definitions of x"*" and x~ we have x = x"*" — x“, (x'’',x“) = 0 and 
x+j x“ G R” . 

Lemma 1. Letx,y£W^. T/ien Hy"''— x"''— diag(sgn(x"''))(y — x)|| < ||?/— x||. 

Proof. For each i G {1,..., n}, we have two possibilities: 

(a) Xi < 0. In this case, sgn(x+) = 0. Thus, |y+ - xf - sgn{xf){yi - Xj)| = |y+| < \yi - Xj|. 

(b) Xi > 0. In this case, sgn(x+) = 1. Hence, \yf - xf - sgn{xf){yi - Xi)\ = \yf - yf < \yi - xf. 

Combining (a) and (b) we have {yf — xf — sgn{xf){yi — x,))^ < (y* — x,)^, for alH = 1,..., n, which 
implies the desired inequality. □ 

The matrix I G R"^” denotes the identity matrix. If x G R"' then diag(x) G will denote a 

diagonal matrix with (f,f)-th entry equal to Xj, f = 1,..., n. Denote ||M|| := max{||Mx|| : x G 
R"", ||x|| = 1} for any M G R"'^"'. The next useful result was proved in 2.1.1, page 32 of [^ . 

Lemma 2. Let E G R”^". If ||F|| < 1, then E — 1 is invertible and ||(F — I)~^|| < 1/ (1 — ||F||) ■ 

We end this section with the contraction mapping principle (see 8.2.2, page 153 of [S])- 

Theorem 1 (contraction mapping principle). Let f : R” —>■ R”. Suppose that there exists A G [0,1) 
such that ||</>(y) — <?i(x)|| < A||y — x||, for all x,y G R"'. Then there exists a unique x G R" sueh that 
4>{x) = X. 

2 A semi-smooth Newton method for a piecewise linear systems 

In this section we present and analyze the semi-smooth Newton’s method for solving ([1]). We begin 
with an existence result of solution to the equation (fT|). 

Proposition 1. Let A G R. If ||T"~^|| < A < 1 then dll) has unique solution for any 6 G R”. 

Proof. The equation ([T|) has a solution if only if (t){x) = —T~^x'^ + T~^h has a fixed point. It follows 
from definition of 4> that 


4>{y) - 4>{x) =-T ^{y^ - x^), x,yGR”. 

Since ||T~^|| < A < 1, the last equality implies that \\4>{y) — <?i(x)|| < A||y — x||, for all x,y G R”. 
Hence (p is a, contraction. Therefore applying Theorem [1] we conclude that (p has precisely a unique 
fixed point and consequently ([T]) has a unique solution. □ 
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Then the assumption ||T“^|| < 1 in Proposition [T] is sufficient to the uniqueness of solution of 
([1]). The next example shows that it is not possible to increase the upper bound of and still 

ensure the uniqueness of solution in ([T]). 

Example 1. Consider the function F : ^ defined by F{x) = + Tx — b, where 


T = 


-1 

0 


0 

1 ’ 


b = 


0 

2 


Note that \\T ^|| = 1 and there holds F{x*) = F{x**) = 0, where x* = [1, and x** = [0,1]^. 


The semi-smooth Newton method introduced in |36] for finding the zero of the function 

F{x) := + Tx — 6, x € M"", (4) 

with starting point x^ G M”, it is formally defined by 

F{x^) + V^ {x^+^-x^'^ V’^€dF{x'^), A: = 0,1,..., (5) 

where is any subgradient in dF{x^) the Clarke generalized Jacobian of F at x^ (see Defini¬ 
tion 2.6.1 on page 70 of [l3]). Letting 


P(x) := diag(sgn(x^)), x G M”, 

it easy to see that 

P(x)+ r G 5F(x), xGM*". 

Since P{x)x = x"'' for all x G M”, taking = P(x^) -|- T, equation ([5]) becomes 

P(x^) + rl x^+^ = 6, A: = 0,1,..., 


( 6 ) 


(7) 


which dehne formally the semi-smooth Newton sequence {x^} for solving (HD- Note that the above 
iteration is exactly the one stated in equation (6) of [7]. We devote the rest of this section to 
studying the convergence properties of this sequence. 

Proposition 2. Assume that the matrix P{x) -\-T is nonsingular for all x G M”. Then, {x^} is 
well defined and bounded from any starting point. Moreover, for each accumulation point x of {x^} 
there exists an x G M” such that 

[P{x) + T]x = b. (8) 

In particular, if sgn{x^) = sgn{x~^), then x is a solution of (fT|). 


Proof. To prove this result we follow similar arguments of Proposition 3 of [27]. □ 

The next proposition gives a condition for the Newton iteration ([7|) to finish in a finite number of 
steps, which can be proved by using the same argument as the one used in the proof of Lemma 3 

of [7|. 

Proposition 3. If in (j7|) it happens that sgn{{x^~^^)^) = sgn{{x^)~^), then x^"*"^ is a solution of 

m- 

Next, we state and prove a theorem for the semi-smooth Newton’s method ([7|) for solving (fT]l. 
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Theorem 2. Let b € R” and T G he a nonsingular matrix. Assume that ||T ^|| <1. Then, 

for any starting point G R", {x^} is well-defined. Additionally, if 

||r-i||<i/2, (9) 

then {x^} converges Q-linearly to x* G R”, the unique solution of as follows 


llx* - < 


n-ll 


1 - IIT 


-II 


\x — X 


A: = 0,1,. 


( 10 ) 


Proof. Let x G R”. Since ||T ^|| < 1, the definition of P{x) implies ||T ^P(x)|| < ||T ^|| < 1. 
Thus, Lemma [2] implies that —T~^P{x) — I is nonsingular. Because T is nonsingular and 

P{x) + T = -T[-T-^P{x)-l], xGR”, 

we conclude that P{x) + T is also nonsingular. Hence, for any starting point x® G R”, © implies 
that {x^} is well-defined . 

Using Proposition [ll we conclude that ([T|) has a unique solution x* G R"'. Since x* G R” is the 
solution of ©, we have [T’(x*) + T]x* — 6 = 0, which together with definition of {x^} in ([7D and 
(|6|l implies 

3.*_xfc+i = [[p(^*)+2"]x*-6-[P(x^)+r]x^+6-[P(x^)+T](x*-x*^)], A: = 0,1,... . 

On the other hand, since P{x)x = x"*" for all x G R”, after simple algebraic manipulations we obtain 
[P(x*) + r]x* - 6 - [P(x^) + T]x^ + 6 - [P(x'=) + r](x* - x^) = (x*)+ - (x^)+ - P{x^){x* - x^), 


for A: = 0,1,.... Combining the two above equalities and using properties of the norm we have 


X — X 


fc+i| 


< 


P(x'') + T]-^ (x*)+ - (x'')+ - P(x^)(x* - x'') 


A: = 0,1,.... 


It follows from Lemma[I]that ||(x*)''' — (x*^)"'' — P{x^){x* — x^)|| < ||x* — x^||, for A: = 0,1,..., and 
the last inequality becomes 


\x*-x^+^\\ < 


[Pix^) + T] 


-1 


|x*-x^| 


A: = 0,1,.... 


( 11 ) 


On the other hand, after some algebra and using properties of the norm, we have 


[P(x^) + T] 


-1 


[-r-ip(x^) -1]-^ [-T-^] < [T-^P{x'^) + I] 


-1 


P-^W, A: = 0,1, 


which combined with Lemma [2] and considering that ||r ^P(x^)|| < ||r ^|| <1, implies 

|r"M| 


[P(x'') + T] 


-1 


< 


i-||r 


-ii 


A: = 0,1,... . 


Thus, last inequality together with (fTH gives (fTOl) . Assumption Q implies ||r“^||/(l — ||r“^||) < 1. 
Therefore, (HOI) implies that {x^} converges Q-linearly, from any starting point x^, to the solution 
X* of (HI). Hence the theorem is proven. □ 


For stating the next result we need the following definition. Let S := {sij) G R”^” be with i— th 
row Si := {sii ,..., Sin)'^, i = 1,..., n. We say that S has , if for each i—th row s, either Sj > 0 or 

Si < 0 . 
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Example 2. The following three matrices have its rows with definite sign: 


■-2 -3 -1 


'2 3 1' 


■-2 -3 -T 

1 1 2 

? 

112 


-1 -1 -2 

5 2 1 


5 2 1 


-5 -2 -1 


Example 3. It follows from jlSSl Theorem 2] that, if A ^ is a non-singular M-matrix then 

{A + D)~^ > 0, for each diagonal matrix D G with D > 0. In particular, if A is an 

M-matrix, then {A + D)~^ has its rows with definite sign, for each D > 0. 

Theorem 3. Assume that © has solutions. If [-P(x) + r] ^ exists and have its rows with definite 
sign, for all x G M”. Then {x^} generated by © converges after finite steps for the unique solution 

of ©. 


Proof First of all note that the sequence generated by ([7|) satisfies 

F(x^) + [P(x^) + r](x*^+i-x*^) = 0, A: = 0,1,..., (12) 

where the function F is defined in Q. By direct computation, we have 

F{y) - F{x) - [P{x)+T]{y - x) = P{y)y - P{x)y >0, x,y (13) 

For arbitrary x^ G M”, the above inequality and (1121) imply that 

F{x^) > F(x°) + [P(x°) + r](x^ - x°) = 0. 

Thus, applying an induction argument we conclude that 

F{x’^) = [P{x'^)+T]x’^-b>0, A: = 1,2,.... (14) 

Let X* be a solution of (JH). Letting y = x* and x = in (fT3|) . we obtain 

0 = F{x*) > P(x^) + [P(x*^) + T]{x* - x^). (15) 

Since Si = {sn, ..., Sin)^ , the i— th row of [P(x) + r]“^ =: (s^), has all elements either non-negative 
or non-positive, we have only two options: sgn(s^) has its components equal to —1 or 0, or sgn(s^) 
has its components equal 0 or 1. Multiplying both sides of (fTHI) by [P(x^) -|- T]~^ and using (fT4)l . 
we have 


X* < Xi - SiP(x^) <Xi, i e 1+ := {l < i < n : sgn(sf) G {0, 1}} , (16) 

for all A: > 1, and similarly 

x* > x^ — SiF{x^) > xf, f G /_ := {l < f < n : sgn(sf) G {—1,0}} . (17) 

Note that as \T + P(x^)]“^ exists, then there are no indexes i and j such that Sj = Sj, thus 
/+ n /_ = 0 and /_|_ U /_ = {1, 2 ... , n}. It follows from ([5]), ([7]) and = P(x^) -|- T that 

^k+i = + P(x^)]-i6 = x^ - [P(x'=) + r]-^P(x^), A: = 0,1,.... 


Therefore, using ()14p and the definition of /+, we obtain 




i e U 


(18) 
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where the first inequality above follows from (I16p . and analogously using (1171) . we have 

X* > > xf, iel.. (19) 

Hence, {x^} converges, because {x^} is monotone and bounded by x* for i = 1,... , n. Thus, {xf} 
has a limit Ui. Therefore, {x^} converges to the vector u with components Ui. By using again (fT^ . 
we have 

||F(u)||= lim ||F(x^)||= lim ||[P(x^) + r](x^+^ - x^)|| < (1 + ||r||) lim ||x^+^ - x^|| = 0. 

k^oo k^oo k^oo 

Therefore, u is a solution. Furthermore, for any two solutions x* and y*, (|13ll implies 

0 = F{x*) - F{y*) >[T + P{y*)]{x* - y*)- (20) 

Then, multiplying by [T + P[y*)]~^ we obtain 

yl >x* ie 1+ and y* < x* i€ 

The result follows by reversing the roles of x* and y* in ()20p . Thus, the problem has a unique 
solution equal to the limit of the sequence {x^} generated by (ffp. 

Finally we establish the finite termination of the sequence {x^} at the unique solution of problem 
m, which will be denoted by x*. Since for all x G M” P{x) has at most 2"' different choices, 
then there exist j,£ G N with 1 < .^ < 2"’ such that P{x^) = P{x^^^). Note that ii I = 1, then 
Proposition [3] implies that is solution of ([T]). This statement implies that 

= [r + P{x^)]-^h =[T + P{x^+^)]-^b = x^'+^+^ 


Applying inductively this argument, 

Jb - Jb ^ tjb - ijU ^ ^ tjb - tJb ^ Jb - Jb - Jb • 

Thus, the sequence {x^} generated by ([7]) has at most j + i different elements. Now using (fTsp and 
cni), we obtain 


x^^ > x^+^ 


> 


> rJ+^+~^ — rJ + ^ 
— Xj — Xj , 


G 1 . 


+ > 


i G 


and 

x\ <x\ <■■■ <X\ = x\ , 

Hence, x^^^ = x-^^^ and in view Proposition [3] we conclude that is solution of ([T|), i.e., 

x^+2 = x*. □ 


It is worth mentioning that Theorem [3] generalizes Theorem 2 of [7], in the special case [P(x) + 
T]~^ > 0, for all x G M"’. The invertibility of P(x) + T, for all x G M”, is sufficient to the well- 
definedness of the semi-smooth Newton method. However, the next example show that, for the 
convergence of these methods, an additional condition on T must be assumed, for instance, ([9]) or 
[P(x) -|- T]~^ exists with its rows having dehnite sign, for all x G M”. 

Example 4. Consider the function P : ^ defined by P(x) = x^ -|- Tx — 5, where 


T = 


-2 

-1 


3 

1 ’ 



Note that ||T'~^|| = 3,86..., the matrix P(x) -|- P is invertible and have no rows with definite sign, 
for all x G M^. Moreover, F has x* = [2,-1]'^ as the unique zero. Applying semi-smooth Newton 
method starting with x*^ = [—3,3]^, for finding the zero of F, the generated sequence oscillates 
between the points 
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3 Application to quadratic programming 


In this section, we apply the results of Section [2] to solve ([2]), in order to hnd a solution of ([3]). We 
begin showing that, from each solution of ([2|) we obtain a solution of ([3]). From now on we assume 
that Q is a symmetric and positive dehnite matrix. 

Proposition 4. If the vector x* is a solution of ([2]), then (x*)"*' is a solution of ([3]). 


Proof. The optimality conditions of the problem in ([3|) are given by 

x G , Qx + bGM'f, (^Qx+ b,x'^ = 0. (21) 

We claim that (x*)+ is a solution of (1^ . We know that (x*)+ — x* = {x*)~. Thus, if x* G M” is a 
solution of ([2]), then 

Q{x*)+ + b={x*)-. 

Hence, by using {x*)~ G M" and {{x*)~, (x*)"'') = 0, the last equality easily implies that 

g(x*)+ + 6GM!(, {Q{x*)+ + b,ix*)+) =0. 

Combining this with (x*)"*" G M”, we conclude that (x*)"*" is a solution of (j21l) as claimed, which 
completes the proof. □ 


The semi-smooth Newton method for solving ([2]), with starting point x^ G M”, is given by 


x^+^ = - 


([Q - I] P(x^) + l) ^6, fc = 0,l,.... 


( 22 ) 


Remark 1. If Q — \ is a nonsingular matrix, T = [Q — \\ ^ and b = —Tb, then ([2]) and ([T]) are 
equivalent. Moreover, (l22]) becomes 


x^+^ = 


T-^P{x^) + l 


-1 


T-^b = 


Pix^)+T 


-1 


b, 


k = 0,l,..., 


which is the semi-smooth Newton method defined in m- 

Proposition 5. Let A G M. If \\Q — I|| < A < 1 then (l2|) has a unique solution. 


Proof. The proof follows by combining Remark [T] with Proposition [TJ 


□ 


The next result shows that the semi-smooth Newton defined in (j22p is always well defined. 

Lemma 3. Let x G M"". The following matrix is nonsingular 

[Q - I] P{x) + I. (23) 

As a consequence, the semi-smooth Newton sequence {x^} is well-defined, for any starting point 
x° G M”. 


Proof. The proof of the first part of the lemma, follows similar argument to the proof of Lemma 5 
of |17] . To prove the second part of the lemma, combine the dehnition of {x^} in (j22l) and the first 
part of the lemma. □ 






Proposition 6. If in ()22p it happens that sgn{{x^~^^)~^) = sgn{{x^)~^), then x^~^^ is a solution of 

m- 

Proof. The proof follows combining Remark [1] and Proposition [3l □ 

Proposition 7. The sequence {x^}, defined in is bounded from any starting point. Moreover, 
for each accumulation point x of {x’^}, there exists an x € ffi"" such that 

{[Q-l]P{x)+l)x = -b. (24) 

In particular, if sgn{x~^) = sgn{x~^) then x is a solution of ([2|). 

Proof. Using Remark [1] and Proposition [2] the result follows. □ 

Theorem 4. The sequences {x^} generated by the semi-smooth Newton Method (|22p for solving 
(El), is well defined for any starting point x^ G M”. Moreover, if 

||Q-I||<l/2, (25) 

then the sequence {x^f converges Q-linearly to x* G M”, the unique solution of ([2|), as follows 

||x*-x"+^|| < A; = 0,1,..., (26) 

and (x*)"*" is a solution of (ED- 

Proof. The well-definedness, for any starting point x^ G M”, follows from Lemma El For concluding 
the proof combine. Proposition U Remark [1] and Theorem [2l □ 

Note that ([2^ implies that the eigenvalues of Q belong to (0,^) U (^,1). Let us present an 
important equivalent form of problem (ED- 

Example 5. Given A G a nonsingular matrix, ;= {Ax : x G M”} and z G M"'. The 

projection Par^ {z) of the point z onto the cone is defined by 

PARliz) := axgmm i^^\\z - yf : y G . 

From the definition of the simplical cone associated with the matrix A, the problem of projecting 
the point z G M” onto a simplicial cone may be stated as the following positively constrained 

quadratic programming problem 

Minimize —||z —Ax|p, 
subject to X G K” . 

Hence, if v & M”’ is the unique solution of this problem then we have PAR^iz) = Av. The above 
problem is equivalent to the following nonegatively constrained quadratic programming problem 

Minimize -x~^Qx -\- x^b -\- c (27) 

subject to X G M”, 

by taking Q = A~^A, b = —A~^z and c = z^zj^. The optimality condition for problem (1771) implies 
that its solution can be obtained by solving the following linear complementarity problem 

y — Qx = b, X > 0, y > 0, (x, y) = 0. (28) 
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Remark 2. It is easy to establish that corresponding to each nonnegative quadratic problems (I27p 
and each linear complementarity problems (l28|) associated to positive definite matrices, there are 
equivalent problems of projection onto simplicial cones. Therefore, the problem of projecting onto 
simplicial cones can be solved by active set methods f^25W20l[2^ or any algorithms for solving LCPs, 
see e.g., and special methods based on its geometry, see e.g., Other fashionable ways 

to solve this problem are based on the classical von Neumann algorithm (see e.g., the Dykstra 
algorithm Nevertheless, these methods are also quite expensive (see the numerical 

results in JMEj and the remark preceding section 6.3 in m) 


4 Computational results 

In this section we test our semi-smooth Newton method (j22[) to find solutions on generated random 
instances of (l2|). We present two types of experiments. In one of them, we guarantee that for each 
test problem the hypotheses given in Theorem [4] are satisfied and in the other they are not. 

All programs were implemented in MATLAB Version 7.11 64-bit and run on a 2>.AQGHz Intel 
Core f5 — 4670 with 8.0GB of RAM. All MATLAB codes and generated data of this paper are 
available in http://orizon.mat.ufg.br/pages/34449-publications. 

All experiments are based on the following general considerations: 

• In order to accurately measure the method’s runtime for a problem, each one of the test prob¬ 
lems was solved 10 times and the runtime data collected. Then, we defined the corresponding 
method’s runtime for a problem as the median of these measurements. 

• Let TolV G M+ be a relative bound, we consider that the method converged to the solution 
and stopped the execution when, for some k, the condition 

||u-x'=|| < TolA(l + ||u||), 

is satisfied. If the previous stopping criteria are not met within 100 iterations, we declare that 
the method did not converge. 

4.1 When the hypotheses of Theorem [4] are satisfied 

In this experiment, we studied the behavior of the method on sets of 100 randomly generated 
test problems of dimension n = 2000, 3000,4000, 5000, respectively. Furthermore, we analyzed the 
influence of the initial point in the convergence of the method on 1000 randomly generated test 
problems of dimension n = 100. For each test problem in this experiment the hypotheses given in 
the Theorem H] are satisfied, generating each of them as follows: 

1. To construct the matrix Q G symmetric and positive definite satisfying the assumption 

(j25p in Theorem 01 we first chose a random number fi from the standard uniform distribution 
on the open interval (0,1/2). Secondly, we compute the singular value decomposition 
of a symmetric and positive definite matrix of the form where B is a generated n x n 

real nonsingular matrix containing random values drawn from the uniform distribution on the 
interval [—10®, 10®]. Finally, in the present case the equality V = U holds and we compute 
the matrix Q from 

Q = U [1 + 
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where a is the largest singular value of S. It is important to note that by construction of the 
matrix Q always = \\Q — I||. 

2. We have chosen the solution u € M"' containing random values drawn from the uniform 
distribution on the interval [—10®, 10®] and then we have computed b G M"" from equation (j^ . 

3. Finally we have chosen a starting point x® G R” containing random values drawn from the 
uniform distribution on the interval [—10®, 10®]. 

In accordance with the theoretical convergence of the method, ensured by Theorem [U the com¬ 
putational convergence is obtained in all cases. 

The computational results to analyze the behavior of the method on sets of 100 generated random 
test problems of different dimensions, are reported in Table [TJ Prom these, it can be noted that 
for the same dimension, to achieve higher accuracy, the method does not experience a signihcant 
increase in the number of iterations or in runtime. On the other hand, the increase in the dimension 
of the problems does not necessarily involve an increase in the number of iterations to achieve the 
same accuracy, however, a larger runtime is consumed. A larger runtime consumption is associated 
with the fact that the semi-smooth Newton method (|22p requires the solution of a linear system in 
each iteration, whose computational effort increases with the dimension of the problem. Another 
important aspect that can be checked in Table [1] is the ability of the method to converge in about 
three iterations on average. 


n 

Total Iterations 

Total Time 

2000 

278 

294 

296 

142.48 

147.77 

148.05 

3000 

282 

295 

299 

445.61 

465.48 

471.65 

4000 

278 

297 

300 

1013.79 

1082.43 

1093.55 

5000 

285 

303 

307 

1945.23 

2067.08 

2112.72 

TolA 

10“® 

10"® 

10-10 

10"® 

10"® 

10-10 


Table 1; Total overall iterations and total time in seconds, performed and consumed, respectively 
by the semi-smooth Newton method (I22|) to solve the 100 test problems of each dimension for 
different accuracies. 

In order to study the influence of the initial point in the convergence of the method, we have 
generated 1000 test problems of dimension n = 100 and we have associated to each of them 1000 
generated initial points. We have solved each problem with the 1000 corresponding initial points. 
Then, we have computed the standard deviation (STD) di and the mean value (MEAN) m, of the 
number of iterations performed by the method to solve the problem i taking each one of the 1000 
initial points. Finally we have computed the mean of all di and the mean of all rnt, i = 1 ,..., 1000. 
All cases converged, indicating robustness of the method with respect to the starting point. The 
results are shown in Table [2l The standard deviation of the number of iterations performed by the 
method to solve the problem i with the 1000 initial points gives us an idea of the influence of the 
initial point in the number of iterations performed by the method in each problem. The reported 
means of these standard deviation values give us an idea of the influence of the initial point in the 
number of iterations performed by the method in all the problems in general. The results in the 
table show that on average the number of iterations performed by our method to find the solution 
for a problem varies only very slightly with the chosen starting point. Again we see that the average 
number of iterations performed is less than three. 
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Tol X 

MEAN ({di}i=i,...,iooo) 

MEAN ({mi}j=i^...4ooo) 

o 

1 

0.2450 

2.3331 

10-8 

0.2530 

2.3454 

10-10 

0.2536 

2.3457 


Table 2: Influence of the initial point in the convergence of the semi-smooth Newton method (1221) 
on a total of 1000 test problems of dimension n = 100 each of them with 1000 generated initial 
points for different accuracies. 

4.2 When the hypotheses of Theorem [4] are not satisfied 

In this experiment, we studied the behavior of the method on 1000 test problems of dimension 
n = 1000, where the hypotheses given in the Theorem 0] are not all satisfied. 

In this case, the test problems were built almost as in the previous experiment. The only difference 
was in the construction of the matrix Q G not satisfying the assumption (|25p of Theorem [H 

Namely, we chose the random number /3 from the standard uniform distribution on the interval 
[lb,uh), where ^ < lb < ub. 

According to the obtained numerical results, we can conjecture that our method converges to a 
much broader class of problems, not satisfying the hypotheses of Theorem U) However we detected 
that convergence with high accuracy to the solution largely depends on the magnitude of the value 
of the norm in condition (|25l) . This idea can be observed inspecting Table [3l As the magnitude of 
the value of the norm in (I25p increases sufficiently, the number of problems for which the method 
converges to the solution with greater accuracy decreases. This phenomenon, of course, is not as¬ 
sociated to the convergence of the method for a specific problem, but, rather, there is an optimum 
accuracy achievable due to the accumulated errors. Small tolerances do not ensure obtaining ac¬ 
curate results. It can be the case that convergence is overlooked and unnecessary iterations are 
performed. It is important to note in the table that, even when the hypothesis is unfulfilled, the 
method converges for these problems, however it can be noted that the number of iterations per¬ 
formed by the method increases with respect of the previous experiments in which the hypotheses 
were fulfilled. 


13 € [lb, ub) 

Solved Problems 

Iterations 

[0.5,108) 

1000 

1000 

1000 

7.2160 

7.2190 

7.2190 

[108,10^) 

1000 

1000 

1000 

9.1800 

9.1850 

9.1850 

[10^108) 

1000 

1000 

1000 

9.6730 

9.6760 

9.6760 

[108,10®) 

1000 

1000 

693 

10.2820 

10.2860 

10.2540 

[10®, 10^) 

1000 

999 

0 

10.3870 

10.3874 

- 

[10^10®) 

998 

690 

0 

10.4339 

10.4246 

- 

Tol X 

10“® 

10-8 

10-10 

10-® 

10-8 

10-10 


Table 3: Number of problems solved by the semi-smooth Newton method (I22p on a total of 1000 
test problems of dimension n = 1000 of each condition {lb < fi < ub) for different accuracies, and 
the mean number of iterations performed by the semi-smooth Newton method (1221) to solve one 
problem in each case. 
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5 Conclusions 


In this paper we studied a special class of convex quadratic programming under positive constraint, 
which, via its optimality conditions, is reduced to finding the unique solution of a nonsmooth 
system of equations. Our main result shows that, under a mild assumption on the simplicial cone, 
we can apply a semi-smooth Newton method for finding a unique solution of the obtained associated 
nonsmooth system of equations and that the generated sequence converges linearly to the solution 
for any starting point. It would be interesting to see whether the used technique can be applied for 
solving more general convex programs. 

Since the optimality condition of a positive constrained convex quadratic programming problem 
is equivalent to a linear complementarity problem, which is equivalent to the problem of hnding the 
unique solution of a nonsmooth system of equations, another interesting problem to address is to 
compare our semi-smooth Newton method with active set methods [5l l25[[26ll29j . 

This paper is a continuation of where we studied the problem of projection onto a simplicial 
cone by using a semi-smooth Newton method. We expect that the results of this paper become a 
further step towards solving general convex optimization problems. We foresee further progress in 
this topic in the near future. 
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